Improvement of electrolaryngeal speech by introducing normal excitation information
نویسندگان
چکیده
In electrolaryngeal speech, an excitation signal is provided by means of a buzzer held against the neck which is usually operated at a constant frequency rate. While such Transcutaneous Artificial Larynges (TALs) provide a means for verbal communication for people who are unable to use their own, the monotone F0 pattern results in poor speech quality. In the present study, cepstral analysis was used to replace the original F0 contour of the TAL speech with a normal F0 pattern. Spectral analysis shows that this substitution results in two changes: (a) a varying F0 contour and (b) removal of steady background noise due to the leakage of acoustic energy. Perceptual tests were conducted to assess speech, before and after cepstral processing, produced by four laryngectomized speakers (2 males and 2 females). All speakers used the Servox TAL. The results indicate a clear preference for the processed speech.
منابع مشابه
An Evaluation of a Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Prediction
An Evaluation of a Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Prediction Kou TANAKA†, Tomoki TODA†, Graham NEUBIG†, Sakriani SAKTI†, and Satoshi NAKAMURA† † Graduate School of Information Science, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma-shi, 630-0101, Japan E-mail: †{ko-t,tomoki,neubig,ssakti,s-nakamura...
متن کاملA Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation
This paper presents an electrolaryngeal (EL) speech enhancement method capable of significantly improving naturalness of EL speech while causing no degradation in its intelligibility. An electrolarynx is an external device that artificially generates excitation sounds to enable laryngectomees to produce EL speech. Although proficient laryngectomees can produce quite intelligible EL speech, it s...
متن کاملEnhancement of electrolaryngeal speech by spectral subtraction, spectral compensation, and introduction of jitter and shimmer
An electrolarynx, a verbal communication aid used by laryngectomy patients, is a vibrator held against the neck tissue to provide excitation to the vocal tract, as a substitute to that provided by the glottal vibrations. Although the user can set the vibration level and pitch, a dynamic control of level, voicing, and pitch during speech production is not feasible. In addition to this basic limi...
متن کاملEvaluation of Excitation Feature Prediction in a Hybrid Approach to Electrolaryngeal Speech Enhancement
We implement removing micro-prosody with low-pass filtering and avoiding Unvoiced/Voiced (U/V) prediction as part of a hybrid approach to improve statistical excitation prediction in the hybrid approach to electrolaryngeal (EL) speech enhancement. An electrolarynx is a device that artificially generates excitation sounds to enable laryngectomees to produce EL speech. Although proficient larynge...
متن کاملPhysically Constrained Statistical F0 Prediction for Electrolaryngeal Speech Enhancement
Electrolaryngeal (EL) speech produced by a laryngectomee using an electrolarynx to mechanically generate artificial excitation sounds severely suffers from unnatural fundamental frequency (F0) patterns caused by monotonic excitation sounds. To address this issue, we have previously proposed EL speech enhancement systems using statistical F0 pattern prediction methods based on a Gaussian Mixture...
متن کامل